WebVision Database: Visual Learning and Understanding from Web Data
نویسندگان
چکیده
In this paper, we present a study on learning visual recognition models from large scale noisy web data. We build a new database called WebVision, which contains more than 2.4million web images crawled from the Internet by using queries generated from the 1, 000 semantic concepts of the ILSVRC 2012 benchmark. Meta information along with those web images (e.g., title, description, tags, etc.) are also crawled. A validation set and test set containing human annotated images are also provided to facilitate algorithmic development. Based on our new database, we obtain a few interesting observations: 1) the noisy web images are sufficient for training a good deep CNN model for visual recognition; 2) the model learnt from our WebVision database exhibits comparable or even better generalization ability than the one trained from the ILSVRC 2012 dataset when being transferred to new datasets and tasks; 3) a domain adaptation issue (a.k.a., dataset bias) is observed, which means the dataset can be used as the largest benchmark dataset for visual domain adaptation. Our new WebVision database and relevant studies in this work would benefit the advance of learning state-of-the-art visual models with minimum supervision based on web data.
منابع مشابه
WebVision Challenge: Visual Learning and Understanding With Web Data
We present the 2017 WebVision Challenge, a public image recognition challenge designed for deep learning based on web images without instance-level human annotation. Following the spirit of previous vision challenges, such as ILSVRC [1], Places2 [2] and PASCAL VOC [3], which have played critical roles in the development of computer vision by contributing to the community with large scale annota...
متن کاملIdentification of the underlying factors affecting information seeking behavior of users interacting with the visual search option in EBSCO: a grounded theory study
Background and Aim: Information seeking is interactive behavior of searcher with information systems and this active interaction occurs in a real environment known as background or context. This study investigated the factors influencing the formation of layers of context and their impact on the interaction of the user with search option dialoge in EBSCO database. Method: Data from 28 semi-stru...
متن کاملEye-Tracking Method’ Usage for Understanding the Cognitive Processes in Multimedia Learning
Introduction: Designing multimedia learning environments should consist of the evidence-based study and principals about the human learning process. Eye tracking is a way based on the learner processing of learning materials which presented in multimedia learning environments. The aim of the study was to examine the use of the eye-tracking method to investigate the cognitive processes in m...
متن کاملInvestigating Healthcare Personnel’s Satisfaction with Quality of Web-based Learning in Teaching Preventive Behaviors of Hepatitis B Virus Infection
Introduction: Acceptance and implementation of preventive behaviors through new methods by healthcare personnel are of great importance. The aim of this study was to investigate healthcare personnel’s satisfaction with quality of web-based learning in teaching preventive behaviors of hepatitis B virus infection.Methods: This descriptive study was conducted on 120 healthcare employees in Tehran ...
متن کاملهمپوشانی سنتی و نسبی پایگاه های اطلاعاتی Scopus و Web of Sciences در حوزه بیماریهای غدد درونریز
Introduction: This study aimed to determine the traditional and relative overlap between Scopus and Web of Science databases in Endocrine System Diseases. Methods: This research is a descriptive survey and an applied study. Research population includes all articles retrieved from Scopus and Web of Science databases. 11 Descriptors and 120 sub-heading were searched in endocrine field in 2009....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1708.02862 شماره
صفحات -
تاریخ انتشار 2017